Similarity Measures for Short Queries
نویسندگان
چکیده
Ad-hoc queries are usually short, of perhaps two to ten terms. However, in previous rounds of TREC we have concentrated on obtaining optimal performance for the long TREC topics. In this paper we investigate the behaviour of similarity measures on short queries, and show experimentally that two successful measures|which give similar, good performance on long TREC topics|do not work well for short queries. We explore methods for achieving greater eeectiveness for short queries, and conclude that a successful approach is to combine these similarity measures with other evidence. We also brieey describe our experiments with the Spanish data.
منابع مشابه
Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures
Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...
متن کاملPresentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures
Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...
متن کاملA Web-based Kernel Function for Matching Short Text Snippets
Determining the similarity of short text snippets, such as search queries, works poorly with traditional document similarity measures (e.g., cosine), since there are often few, if any, terms in common between two short text snippets. We address this problem by introducing a novel method for measuring the similarity between short text snippets (even those without any overlapping terms) by levera...
متن کاملSimilarity Measures for Short Segments of Text
Measuring the similarity between documents and queries has been extensively studied in information retrieval. However, there are a growing number of tasks that require computing the similarity between two very short segments of text. These tasks include query reformulation, sponsored search, and image retrieval. Standard text similarity measures perform poorly on such tasks because of data spar...
متن کاملWhich of the following SPARQL Queries are Similar? Why?
Linked data on the Web can be accessed by SPARQL queries. Previously executed queries to SPARQL endpoints are valuable information sources about the underlying data structure and schema of data sources. These queries reveal how resources are related to each other and they reflect the user interests on the data. Therefore, methods for query logs analysis provides a basis for extracting relevant ...
متن کامل